Search CORE

47 research outputs found

Knowledge base question answering with a matching-aggregation model and question-specific contextual relations

Author: JIANG Jing
LAN Yunshi
WANG Shuohang
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/10/2019
Field of study

National Research Foundation (NRF) Singapore under International Research Centres in Singapore Funding Initiativ

Institutional Knowledge at Singapore Management University

Question answering with textual sequence matching

Author: WANG Shuohang
Publication venue: Singapore Management University
Publication date: 01/04/2019
Field of study

Institutional Knowledge at Singapore Management University

Multi-hop knowledge base question answering with an iterative sequence matching model

Author: JIANG Jing
LAN Yunshi
WANG Shuohang
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 11/11/2019
Field of study

Institutional Knowledge at Singapore Management University

An LSTM model for cloze-style machine comprehension

Author: JIANG Jing
WANG Shuohang
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/03/2018
Field of study

Institutional Knowledge at Singapore Management University

Multi-level head-wise match and aggregation in transformer for textual sequence matching

Author: JIANG Jing
LAN Yunshi
LIU Jingjing
TAY Yi
WANG Shuohang
Publication venue: 'Association for the Advancement of Artificial Intelligence (AAAI)'
Publication date: 20/01/2020
Field of study

Transformer has been successfully applied to many natural language processing tasks. However, for textual sequence matching, simple matching between the representation of a pair of sequences might bring in unnecessary noise. In this paper, we propose a new approach to sequence pair matching with Transformer, by learning head-wise matching representations on multiple levels. Experiments show that our proposed approach can achieve new state-of-the-art performance on multiple tasks that rely only on pre-computed sequence-vector-representation, such as SNLI, MNLI-match, MNLI-mismatch, QQP, and SQuAD-binary.Comment: AAAI 2020, 8 page

arXiv.org e-Print Archive

Institutional Knowledge at Singapore Management University

Association for the Advancement of Artificial Intelligence: AAAI Publications

Learning natural language inference with LSTM

Author: JIANG Jing
WANG Shuohang
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2016
Field of study

Natural language inference (NLI) is a fundamentally important task in natural language processing that has many applications. The recently released Stanford Natural Language Inference (SNLI) corpus has made it possible to develop and evaluate learning-centered methods such as deep neural networks for natural language inference (NLI). In this paper, we propose a special long short-term memory (LSTM) architecture for NLI. Our model builds on top of a recently proposed neural attention model for NLI but is based on a significantly different idea. Instead of deriving sentence embeddings for the premise and the hypothesis to be used for classification, our solution uses a match-LSTM to perform word-by-word matching of the hypothesis with the premise. This LSTM is able to place more emphasis on important word-level matching results. In particular, we observe that this LSTM remembers important mismatches that are critical for predicting the contradiction or the neutral relationship label. On the SNLI corpus, our model achieves an accuracy of 86.1%, outperforming the state of the art.Comment: 10 pages, 2 figure

arXiv.org e-Print Archive

Crossref

Institutional Knowledge at Singapore Management University

A compare-aggregate model for matching text sequences

Author: Jing JIANG
WANG Shuohang
Publication venue: ICLR
Publication date: 06/11/2016
Field of study

Many NLP tasks including machine comprehension, answer selection and text entailment require the comparison between sequences. Matching the important units between sequences is a key to solve these problems. In this paper, we present a general "compare-aggregate" framework that performs word-level matching followed by aggregation using Convolutional Neural Networks. We particularly focus on the different comparison functions we can use to match two vectors. We use four different datasets to evaluate the model. We find that some simple comparison functions based on element-wise operations can work better than standard neural network and neural tensor network.Comment: 11 pages, 2 figure

arXiv.org e-Print Archive

Institutional Knowledge at Singapore Management University